Tag

#ai optimization

6 articles

Hot French startup ZML releases free product to speed inference across lots of AI chips

Learn how to use ZML's open-source inference optimization software to accelerate AI model execution across multiple hardware platforms, demonstrating performance improvements through practical implementation.

Jul 744

Qualcomm Buys Buzzy Chip Startup Modular for Nearly $4 Billion

Learn how to optimize AI models for edge deployment using quantization techniques similar to those developed by Modular AI, which Qualcomm recently acquired for nearly $4 billion.

Jun 2448

28 Tips to Take Your ChatGPT Prompts to the Next Level

Learn how advanced prompt engineering techniques can dramatically improve AI model performance by strategically designing input prompts to guide large language models toward desired outputs.

Jun 2149

How to Build Memory-Efficient Transformers with xFormers Using Packed Sequences, GQA, ALiBi, SwiGLU, and Causal Attention

Learn how xFormers helps make AI models faster and more memory-efficient by optimizing how they process text data.

Jun 1635

Google speeds up Gemma 4 threefold with multi-token prediction

Learn how to implement multi-token prediction for text generation using Google's Gemma 4 model, demonstrating how generating multiple tokens simultaneously can speed up text generation by up to three times.

May 675

tech

iPhone 17e vs. iPhone 17: I compared the two models to decide which has the better value

This explainer explores how Apple's neural engine optimization enables efficient AI processing in mobile devices, comparing the iPhone 17e's approach to the base iPhone 17 model.

Mar 4188